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1. Introduction 

Let Xi , . . . , X n be a sequence of independent identically distributed random variables with a 
common Lebesgue density / : R — > R. Suppose that we observe random variables Yi, . . . ,Y n , 
n € N, given by 

Y j =X j + e j , j = l,...,n, (1) 

where £j are i.i.d. random variables, independent of (Xj) with the Lebesgue density f e . For r G 
(0, 1) the objective is to estimate the r-quantile of the population X given by 

/oo 
Pr{x - Tf)f(x) dx With p T (x) = x(t - fl{ xS C0}) (2) 
-oo 

from the observations Yi, . . . , Y n . Note that with the above definition of p T , a finite first moment 
of Xj would be necessary. To avoid this assumption, p T (x, rf) = t((x — r/) + — x + ) + (1 — t)((x — 
rj)- — X-), for x, 7] £ R, can be used instead of p T (x — rj) in (2) . 

Assuming that the distribution of the measurement error is completely known, Carroll and Hall 
(1988) have constructed a kernel density estimator based on the empirical characteristic function 
(p n (u) '■= — y,"_j e luY \u € R. In practice, however, the distribution of the measurement error is 
usually not known. Instead, we assume that we have at hand a sample from f £ given by 

el,...,s* m , meN. (3) 
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Motivated from applications, we will not assume that the observations (e^) are independent from 
(Yj). In particular, our procedure applies to the experiment setup of repeated measurements, which 
we will discuss below. 

We define the Fourier transform of g e U L 2 (R) by T g(u) := J R e lux g{x) dx for u € 

R. Based on the classical kernel estimator, Neumann (1997) has proposed the following density 
estimator for unknown error distributions 

fb{x) := T -[— — (x), (4) 

where (pa is the Fourier transform of the kernel K, b > is the bandwidth and the character- 
istic function of the error distribution ip s is estimated by its empirical counterpart </j e rn (u) := 
— X^feLi e m£fc , u G R. Obviously, f b depends on the sample sizes n and m which is suppressed in 
the notation. Applying a plug-in approach, our estimator for the quantile q T is then given by the 
quantile of the estimated density 



q Tfb = argminM 6 (?7) with M b {rj) := / p T (x - rf)f b {x) dx. (5) 
??e» J-oo 

Throughout the text we assume that f £ (u) /0,aeM. In particular, in this work we focus on 
error distributions for which the characteristic function decays polynomially in its tails. As shown 
by Fan (1991), these so-called ordinary smooth errors lead to mildly ill-posed estimation problems. 
They are mathematically more challenging than supersmooth errors. 

Although the literature on deconvolution problems is extensive, the problem of adaptive 
deconvolution with unknown measurement errors was only recently addressed. We refer to 
Comte and Lacour (2011); Johannes and Schwarz (2013) and Kappus (2012) for adaptive decon- 
volution of densities with unknown error distributions in the model selection framework. Minimax 
results and other properties for non-adaptive methods are given in Neumann (1997), Mcistcr 
(2004), Neumann (2007), Delaigle, Hall and Mcistcr (2008), Johannes (2009) among others. To 
the best of our knowledge, the problem of quantile estimation in deconvolution was considered 
only in Hall and Lahiri (2008). They constructed a non- adaptive quantile estimator for the case of 
known error distributions. Unlike the estimator (5), their estimator is based on directly inverting 
the distribution function estimator. Indeed, following the classical M-estimator analysis, the error 
of the quantile estimator (5) is directly related to that of the distribution function estimator (cf. 
the error representation (8) below). However, the analysis of the latter was not clear until recently 

Fan (1991) constructed an estimator for the distribution function by integrating the density 
deconvolution estimator. In order to perform an exact analysis of its variance a truncation of 
the integral was required in the estimation procedure. This resulted in a non-optimal estimation 
method for the case of ordinary smooth errors. Trying to circumvent this problem, Hall and Lahiri 
(2008) as well as Dattner, Goldcnshluger and Juditsky (2011) constructed a distribution decon- 
volution estimator based on a direct inversion formula for distribution functions. Applying the 
Fourier multiplier approach by Nickl and Reifi (2012), Sohl and Trabs (2012) have shown that the 
integrated density estimator can estimate the distribution function with y/n-r&te under suitable 
conditions. Still, they do not cover all cases where a parametric rate should be expected. So even 
with a known error distribution, a rigorous answer to the following question was left open: can the 
canonical plug-in estimator be used for minimax optimal distribution function estimation? Can 
quantiles be estimated optimally by the plug-in approach, too? 

To show that the answer to these question is 'y es \ we combine an exact analysis like the one by 
Dattner, Goldcnshluger and Juditsky (2011) together with abstract Fourier multiplier theory as 
in Sohl and Trabs (2012). This combination yields the required minimax optimal results for the 
distribution function and for quantiles. This closes the gap reported by Fan (1991) for distribution 
deconvolution in the ordinary smooth case. Moreover, we show that the minimax optimal rates 
still hold if the error distribution is unknown and has to be estimated, which is mathematically 
challenging. Since the deconvolution operator J 7 ^ 1 [l/ip E ] is not observable, we have to study the 
estimated counterpart J r_1 [(^^-(&u) / '(p £tTn (uj\. As a random Fourier multiplier it preserves the 
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mapping properties of the deterministic T~ 1 [1 /f e ] , but its operator norm turns out to be (slightly) 
larger. 

We can summarize our main contributions as follows: 

(i) The case of unknown error distributions is studied and minimax rates of convergence of 
the estimator q Tb are established. Unlike previous studies, we do not assume that the ob- 
servations (e£) arc independent from (Yj). Moreover, all results carry over to the case of 
deconvolution of distributions and quantiles under a known error distribution. 

(ii) Using the Lepskh (1990) method, an adaptive version of the estimator q T ^ is constructed 
whose convergence rates lose only a logarithmic factor compared to the oracle estimator. 

(iii) Compared to previous results, the conditions on the density / arc significantly weaker. In a 
natural way, the rates depend only on the local Holder smoothness of the density / around 
the true quantile and the decay rate of <p £ . Neither tail conditions nor global smoothness 
conditions on / are needed. 

The rest of this paper is organized as follows. In Section 2 we establish the minimax properties 
of the quantile estimator for both the known and unknown error distribution cases. The adaptive 
estimation is developed in Section 3. In Section 4 we apply our estimation procedure in simulations 
and a real data example. The proofs are postponed to Section 5. 



2. Convergence rates 
2.1. Setting and results 

Before we start with the error analysis, let us describe the class of densities we are interested 
in. Denoting (a) as the largest integer which is strictly smaller than a > 0, we define for some 
function g and any possibly unbounded interval I CM. the Holder norm 



l^,:=X><'»IU+ -y. (6) 



and with R > 



C a (I,R) := {g£C°(I)\\\g\\ Ca{I) <R}, 

where C°(I) is the space of all continuous and bounded functions on the interval I. Let J-(R) 
denote the set of all probability densities on the real line which are uniformly bounded by some 
constant R > 0. Throughout, we consider for finite positive constants i?, r, £ and the smoothness 
index a > the class 



C a (R,r,C) := {/G F(R) 



f has a r-quantile q T £ K such that 
feC a ([q T ~ C, q T +(},R) and f(q T ) > r}. 



Note that the quantile q T is unique given the assumption f(q T ) > 0. Taking the derivative in (5) 
and restricting to a growing interval, the estimator q^^ can be defined as solution of the estimating 
equation 

Q = M' b {ri)= ( f b {x)dx-T for some r,€[-U n ,U n ]. (7) 

J — oo 

Throughout, let ([/„) be a sequence of positive real numbers with U n — > oo and which are of 
the order log n. In view of finite computational time, any algorithm has to restrict to a bounded 
interval. Since q T is fixed for / € C a (R, r, £), the true quantile will be contained in this interval 
for n sufficiently large. Equation (7) illustrates the relation to the approach by Hall and Lahiri 
(2008) who invert the distribution function. However, without further assumptions fb will not be 
intcgrable in general. As a key result in our analysis we state the following lemma. In particular, 
it will be used to show that the integral in (7) is finite (cf. decomposition (11) below). 
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In the sequel we use the Landau notation O and Op. For two sequences A n ($) , B n ($) depending 
on a parameter A n (d) = Op(B n (d)) holds uniformly over a parameter set $ £ O if there is for 
all c > some C > such that sup^ ee P# > CB n (d)) < c. If A„ ($)/£?„ (#) converges in 
probability to zero, we write A n (d) = op(B n (-d)). 

Lemma 2.1. Let E[|e£| 4 ] < oo and assume for /3 + > f3 > 



I^HI-^Oftl + M)") and |^(«)I=0((1 



as well as mc 



lb 2{S+l 

for any s ^ and for any f € C s+ ^ 
■tp K {bu) 



Then there exists a random variable £{, 



7"" 



f 



tp K (bu) 



o P (i V 



^ £ b\\f\\ C s+P+< 



such that 



To prove this lemma, we will show that the linear map C s+I3 (R) 9 / i-» 
J- 1 [<fK{bu) F f(u)/(p Sl m(u)] € C S (R) is bounded. More precisely, the operator norm of the ran- 
dom Fourier multiplier LpK{bu) / Lp £tm (u) can be bounded by the random variable Eb- The condition 
on the derivative (p' £ is natural in the context of Fourier multipliers. 

Given the assumptions of the lemma the right-hand side of (7) is finite and of course it is 
continuous in rj. Hence, the estimating equation always has a solution for U n large enough. It does 
not have to be unique since /& is not necessarily non-negative. Nevertheless, any choice converges to 
the true quantile, assuming the latter is unique. Recalling that we write ip £ := T f £l the conditions 
in Lemma 2.1 motivate the definition of the class of error densities f e 

2^(i?, 7 ) := {fe € 7(cx))|i(l + \u\)-P < \Tf £ (u)\ < R(l + \u\)^, 

\(Tf £ )'(u)\ ^ R(i + ny 1 -?, \\x*f c (x)\\ L i < r} 

for some moment 7 > and we use the same constant R as above for convenience. The upper 
bound for |</5 E (w)| in T>P(R,j) is only necessary for the adaptive estimation. 
We will need some properties of the kernel to construct our estimator. 

Assumption 1. Let the kernel K € L 1 (R) satisfy 

(i) supp TK C [—1,1] and 

(ii) K has order N, i.e., for k = 1, . . . £ 

K(x)dx = l, [ x k K{x) dx = and [ \K(x)\\x\ e+1 dx < 00. 



To control the estimation error of q Ty b, we follow the classical M-estimation approach, or more 
precisely the Z-estimator approach (cf. van der Vaart (1998)). Let M(rf) be the deterministic 
counterpart of Mb (rj) defined in (5). The quantities q Ty b and q T are given by the roots of the 
derivatives M' h and M', respectively. From the Taylor expansion = M^qr^) = M^{q T ) + (q T ,b — 
q T )MH{q*) for some intermediate point q* between q T and q T ,b, we obtain 



Qr,b 



J_ T 00 (A(x)-/(x))dx 
Vb(q*) 



(8) 



Hence, the error q T ^ — q T is affected by the numerator and the denominator in this representation. 
We now present two propositions that deal with these two terms. These propositions are intrinsic to 
our analysis, but may also be of interest of their own. The first proposition deals with the numerator 
in (8) and essentially establishes minimax rates of convergence for distribution deconvolution with 
unknown error distributions. All convergence rates are determined by the minimum of the sample 
size n of the observations Yj and of the sample size m of the observed errors e* k . Therefore, we 
suppose n ^ m throughout. Note that the quotient in (8) might explode if fb(q*) becomes very 
small for large stochastic error. Hence, we establish convergence rates as Op results. 
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Proposition 2.2. Suppose that Assumption 1 holds with £ = (a) + 1 and let 6* = 

7 - i - 1 /(2a+2(/3vi/2)+i) _ Then for any a 1/2, /3,ii,r,f > and 7 4 we /lave uniformly over 
f £ C Q (i?,r,C) and / £ e V^(R,j) as n -> 00, 



(/bi(x)-/(a ; ))dx =0 P (V„(a,/3)), 



where 



(log n/n) 1 / 2 , 

n -(Q+l)/(2a+2/3+l) 



/or /3e (0,1/2), 
/or /3 = 1/2, 
/or/3 > 1/2. 



(9) 



In view of the lower bounds stated in Fan (1991), as long as n < m, the rates above are optimal 
and estimating the distribution function by integrating a density deconvolution estimator is a 
minimax optimal procedure. Hence, this proposition closes the gap reported in Fan (1991) and 
further extends the results to the case of unknown error distributions. In the studies of density 
estimation with unknown error distribution, for instance Neumann (1997); Johannes (2009), the 
sample size m of the error can be of smaller order than the number n of the observations 
(Yj) to obtain optimal rates. This is because the risk of estimating of (p E profits from the decay of 
the characteristic function of Xj . Assuming local regularity on / only, its Fourier transform will 
not decay fast in general such that this effect does not occur. Consequently, it is natural that the 
minimax rates are indeed determined by n A m (cf. the analysis of (27) below). 

Next we would like to understand better the denominator of (8). Lounici and Nickl (2011) 
proved uniform risk bounds for the deconvolution wavelet estimator on the whole real line. How- 
ever, on a bounded interval, which is sufficient for our purpose, uniform convergence of the decon- 
volution estimator ft, can be proved more elementarily. Note that with b„ = (logn/n) 1 '^ 2a+2 ^ +1 ' 
the following proposition yields the minimax rate (logn/n) a /( 2Q+2 ^ +1 ) of the L°°-loss. 

Proposition 2.3. Grant Assumption 1 with £ = (a) . For any a, ft, R 1 r, £ > and 7 ^ we have 
uniformly over f £ C a (R, r, £) and f e £ T>^(R, 7) as n — > 00, 



sup \fb(x + q T ) 
*e(-C,C) 



f(x + q T )\ = Op(b a + 



logn x 1 / 2 



j 6 2/3+] 



In particular, ifb n — > and nb 2 f +1 / logn — > 00 as n — > 00, fb n is a uniformly consistent estimator. 

The density deconvolution estimator is then locally uniformly consistent. The two propositions 
above are the building blocks for the first main result of this paper established in the following 
theorem. 

Theorem 2.4. Suppose that Assumption 1 holds with £ = (a) + l. Letq Tl b be the quantile estimator 
defined in (5) associated with b* n = ri -V(2a+2(^vi/2) + i)_ Then j or any a ^1 1/2, f3, R,r,( > and 
7 4 we have uniformly over / £ C a (R, r, £) and f E £ T)P(R, 7) as n — > 00, 



q T \ = P (i> n {a,p)) 



where ^„(a, /3) is given in (9). 



We finish this subsection by providing the minimax rates for estimating the distribution function 
and the quantiles for the case of known error distributions. As above the estimators are given by 
plug-in, using the classical density estimator 



fb(x) 



ip n (u)lfi K (bu) 



(x), X £ 



(10) 



Due to the known (p s the mathematical analysis is simpler and thus we need weaker assumptions. 
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Corollary 2.5. Suppose that the error distribution is known. Let Assumption 1 hold with I = (a) + 
1. Let q T jy be the quantile estimator based on the density deconvolution estimator (10) associated 
with 6* = rt _1 /( 2 "+ 2 (^ vl / 2 )+ 1 ) . Then for any a, (3, i?, r, £ > and 7 ^ we obtain uniformly over 
f e C Q (i?,r,C) and f £ € V^{R,j) as n -> 00, 

' T (A- (*)-/(*)) da; =Op(V„(a,^)), 

-OO 

|?r,6* -9t| =0 P (ip„(a,p)), 

where ift n (a,/3) is given (9). 

We want to stress again that the only global condition on the density / is uniform bound- 
edness. A tail condition similar to the one of Fan (1991) is not necessary In contrast to 
Dattner, Goldcnshluger and Juditsky (2011), the smoothness is measured locally in a Holder scale 
and not globally by decay conditions of the Fourier transform of /. The former is more natural 
since both, the distribution function and the quantile function are estimated pointwise. 

2.2. Discussion 

Although we do not provide the lower bounds for quantile estimation, it is clear that the above 
rate is the optimal. It is the same rate as the one achieved in Proposition 2.2 which deals with 
the estimation of the distribution function. A better rate cannot be expected (cf. the case of no 
measurement errors). 

In order to prove Proposition 2.2 and thus Theorem 2.4, we apply a smooth truncation function 
a s to decompose the error into 

?T (fb(x) - /(*)) dx = f T (K b * f(x) - /Or)) dx + a s (x + q T )(f b {x) - K b * f(x)) dx 



deterministic error singular part of stochastic error 

+ (1 - a s )(x + q T )(f b (x) - K b * f{x)) dx (11) 



continuous part of stochastic error 

with the usual notation Kb(-) = b^ 1 K(- /b). The function a s can be chosen such that it has compact 
support and satisfies (Ih-o^oi — a s ) s C°°(K). Similar to the classical bias- variance trade-off, the 
deterministic error and singular part of the stochastic error will determine the rate. The continuous 
part, however, corresponds to the estimation error of a smooth (but not integrablc) functional of 
the density. If the error distribution would be known, it would be of order n -1 / 2 . For unknown 
errors we use Lemma 2.1. 

Dealing with unknown error distributions leads to the requirement of 7 4, i.e., that f s 
possesses at least 4 moments. In view of the analysis by Neumann and Reifi (2009) this assumption 
also implies uniform convergence of f e ,m- As seen in Lemma 2.1, our estimate of the operator 
norm of the random Fourier multiplier !F~ [tfK ibu) / ip e . m {u)] is of order Op(l V (n~ 1 / 2 fr~' 3 ~ 1 )). 
This might be larger than the operator norm of the unknown deconvolution operator 1 [1/fe (u)] 
which is uniformly bounded. Yet, for a ^ 1/2 the additional error that appears in the continuous 
part of stochastic error in (11) is negligible. More generally, the continuous part is of smaller or 
of the same order as the singular part whenever n~ 1 b~ 2 ^ A1 ~ 2 is bounded. 

Using the known (p £ , we do not need to estimate the deconvolution operator and thus there is 
no additional error. Consequently, we do not need a moment assumption on the error distribution 
and the continuous part of the stochastic error in (11) is of order n -1 / 2 implying that the minimax 
convergence rates hold true for all a > 0. 

Although the focus of this paper is on ordinary smooth error distributions, it is worth mentioning 
the following. If the errors have a supersmooth distribution, that is characteristic function ip e 
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decays exponentially fast, similar results can be obtained. Let us sketch this case briefly. To get 
an analogous result as Lemma 2.1, suppose that E[|e^| 4 ] < oo and for some (3 > and 70 ^ 71 > 

\ips{v)\- x = 0(e*>W) and \ip' e {u)\ = 0(e-" fl ^), «el. 

Then a similar argument as in the proof of Lemma 2.1 shows for sufficiently small c, 7 > and for 
the bandwidth b* n = c(logn)^ 1 /' 3 



ipi<(b* n u) 



f 



<^ll/llc.(R) where £ b = O p (l V e>* ") 



for any s ^ and for any / G C S (R). In other words, ipxibu) / Lp e _ m (u) is a random Fourier 
multiplier on Holder spaces with exponentially increasing operator norm. Following the lines of 
the proof of Proposition 2.2, one sees that the singular as well as the continuous part of the 
stochastic error in (11) are of the order Op(n~ 1 / 2 e yb ). Combined with the estimate for the 
deterministic error, the choice b* = c(logn)~ 1 / /3 yields for / G C a (R,r,C) as n — > 00, 



(f K (x)-f(x))dx =0 P ((logn) 



Note that this is the minimax rate for distribution function estimation as given in Fan (1991). 
Therefore, also for supersmooth error distributions the integral domain does not need to be trun- 
cated to estimate the distribution function via the plug-in approach. 



3. Adaptive estimation 

The choice of the bandwidth b is crucial in applications. Therefore, we develop a fully 
data-driven procedure to determine a reasonable bandwidth. We follow the approach which 
was originally stated by Lcpskh (1990). More precisely, we use the version proposed in 
Goldcnshluger and Nemirovski (1997). 

To this end, we consider the family of estimators {q T ,b, b € B n } where q T ,b is defined in (5) and 
B n is a finite set of bandwidths. Typically, the band widths are chosen geometrically growing. In 
view of the error representation (8) it is important that fb(qr,b) is a consistent estimator of f(q T ) 
for all b G B n . Therefore, conditions on the bandwidth as in Proposition 2.3 are necessary for the 
whole set B n - Moreover, we keep to the assumption a > 1/2 such that the additional error due 
to bounding the random Fourier multiplier is negligible. We will prove convergence rates for the 
adaptive procedure under the following 

Assumption 2. Let the set B„ := {b nj j,j = 1, N n } consists of a monotone increasing sequence 
of bandwidths such that b n _j + i/b n _j is uniformly bounded in j = 1, . . . , iV n and n 1. For n — > 00 
suppose 

N n S l°gn, (\ogn) 2 b ny N n — > and nb n ^ 2 — > 00. 
Moreover, the optimal bandwidth b* n = 7i _1 ^ 2Q+2 ^ vl / 2 ) +1 ) has to be contained in the interval 

[bn,l,bn,N n ]- 

Obviously, Assumption 2 depends on the true but unknown degree of ill-poscdncss j3. Note that 
in our case the lower bound for the bandwidth is not determined by the variance of the quantile 
estimator itself but by the variance of the density estimator and the minimal smoothing which 
results from a > 1/2. Inspired by Comte and Lacour (2011), we propose the following construction 
of a feasible set B n . 

Lemma 3.1. Defining for A„ := {1, l/y/2, . . . , \/y/n} 

~ ( 1 /lognx 1 / 2 f 1 ^ 1 

b n ,min ■= min |& G A„ : - < \~~^~) J We^niu)^ 1 du ^ 1 j (12) 
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and L n := (b n ^ min (\ogn) 3 ) 1 /i°g™ } \ e t B n be given by 

N n = [lognj and b„.i = b n , min , b n ,j = L j n b n ,i, j = 2,...,N n . 

Then, B n satisfies Assumption 2 for all f G C a (R, r, £) and f e £ T>" (i?, 7) with a > 1/2 and (3 > 

Given the bandwidth set, the adaptive estimator is obtained by selection from the family of 
estimators {q T ,b, b £ B n }- As proposed by Lepskh (1990) the adaptive choice should mimic the 
trade-off between deterministic error and stochastic error. The adaptive choice will be given by the 
largest bandwidth such that the intersection of all confidence sets, which corresponds to smaller 
bandwidths, is non-empty. As discussed above it is sufficient to consider the singular part of the 
stochastic error in (11) only. To estimate the variance of q T> b corresponding to the latter, we define 
for some S > 



~ (y/2 + <5)Vloglogn maxn^b c^x + (<51ogn) 3 max M ^ b 07, 

Ijb '■ = ^r— 

\fb(q.r,b)\ 

with the truncation function a s from decomposition (11) and 



(13) 



'b,X 

11- 

i- 

b ' e 47r 2 r 



1 A/f° , . , v(p K (bu)e iuY n , „ , , \2 



i/b 



1/6 



Vn{u) 2 f 1/b .Fa s (u) 



VK{bu)\ — — du / \ip K (bu)\ 



2 



du. 



Note that we apply a monotonization in the numerator of by taking maxima of o~fi,x and o^^. 
With Tib at hand the adaptive estimator is given according to the following rule. Define for any 
beB n 

U b :=[q T ,b-'Eb,qT,b + 'Eb}- (14) 

The adaptive estimator is given by 

Qr-=q T r K with 6;:=max{6eS„| f| U^tt}. (15) 

Note that b* n is well-defined since the intersection in (15) is non-empty for b = b n ^\. The following 
theorem shows that this estimator achieves the minimax rate up to a logarithmic factor. As 
usual the proof relies on a comparison with an oracle-type choice of the bandwidth. However, all 
ingredients have to be estimated and without assuming independence of Yj and e£, which requires 
special attention. 

Theorem 3.2. Grant Assumptions 1 and 2 with I (a) + 1. Then for any a > 1/2, /3, R, r, £ > 
and 7^4 the estimator q T as defined in (15) satisfies uniformly over f G C a (R,r,£) and f E £ 
VP(R,j) as n -> 00, 

\q T - q T \ = 0p((Vloglogn+ ( log n S f)^ n (a, /?))), 

where i/) n (a,f3) is given in (9). 

Since Yj and e* k are not independent, we have to bound the stochastic error of q T ^ with use of 
the Cauchy-Schwarz inequality to separate the error terms. To estimate the remaining factors, we 
take into account the term (logn 5 ) 3 . If the error density were known, this term would not appear. 
The y/log log n is the additional loss for Op-adaptivity which seems unavoidable. 

To estimate the distribution function, a very similar procedure can be employed because the 
bias- variance trade-off is determined by the estimation of the distribution function anyway. The 
only difference will be to replace the denominator of Sfc by 1/2 which results in 

(2a/2 + 5) \fk>g lognmaxc^.x + (Slogn) 3 max 07^, 

fi^b P^b 

for some S > 0. The proof for an analogous result as Theorem 3.2 follows easily. 
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Table 1 

Empirical root mean square error (RMSE) of the adaptive and naive (in parenthesis) estimators for estimating 
q T for t = 0.25, 0.5, 0.75, based on 500 Monte Carlo simulations with n = m = 1000. 



fc = l,/3 = 2 fc = 2,/3 = 2 k = l,P = 4 fc = 2,ff = 4 

t = 0.25 

RMSE 0.279 (0.296) 
r = 0.5 

RMSE 0.233 (0.170) 
r = 0.75 

RMSE 0.453 (0.541) 



4. Numerical results 

4-1- Simulation study 

We now illustrate the implementation of the adaptive estimation procedure of Sections 3. Our 
small simulation study serves as a proof of viability of our proposed method. 

We run 500 Monte Carlo simulations for four experimental setups. The sample size is set to 
n = 1000 and the external sample of the directly observed error is set to m = 1000 as well 
(here the external sample is independent of the main one). We consider r(l,l) and T(2, 1) for 
the distribution of X where T(k,rj) denotes the gamma distribution with shape parameter k and 
scale r\ such that its density is (T(a)rj k )~ 1 x k ~ 1 cxp{ — x/rj}i^ x ^. ^. Note that the shape k of the 
gamma distribution determines the global smoothness of the density while our convergence rates 
depend on local smoothness a which may be larger. For the error distribution we consider standard 
Laplace distribution (f3 = 2) and the convolution of a standard Laplace with itself (f3 = 4). 

The target quantiles of interest are q T with r = 0.25, 0.5, 0.75. A kernel with flat-top Fourier 
transform was chosen for the estimator (see e.g., McMurry and Politis (2004)) and the adaptive al- 
gorithm was implemented on a geometrically growing grid. Usually, applying this adaptive scheme 
requires an additional tuning of the algorithm (see e.g. Spokoiny and Vial (2009)). However, for 
the specific setup considered here the adaptive procedure seems to be robust and was implemented 
exactly as defined in (14)-(15). In the real data example in the next subsection we compare the 
adaptive estimator to the " naive" quantile estimator given by the inverse of the empirical distribu- 
tion function of the observations Y. Thus we also applied the naive estimator in our simulations. 
The results of this simulation study are given in Table 1. We can see that the results support 
the theory - the empirical root mean squared error (RMSE) is higher for /3 = 4 than for /? = 2. 
Also, we can see that the RMSE is lower for k = 2 than for k = 1 since the gamma distribution 
with larger shape parameter is smoother in our context. For median estimation the naive seems 
to perform quiet well in this experimental setup. 

4-2. Real data example 

High blood pressure is a direct cause of serious cardiovascular disease (Kannel et al. (1995)) and 
determining reference values for physicians is important. In particular, estimating percentiles of 
systolic and diastolic blood pressure by sex, race or ethnicity, age, etc. is of substantial interest. 
However, blood pressure is known to be measured with additional error which needs to be ad- 
dressed in its analysis (see e.g., Frese, Fick and Sadowsky (2011)). Therefore, measurement errors 
should be taken into account, otherwise quantile estimates based on the observed blood pressure 
measurements would be biased. 

We illustrate our method using data from the Framingham Heart Study (Carroll et al. (2006)). 
This study consists of a series of exams taken two years apart where systolic blood pressure (SBP) 
measurements of 1,615 men aged 31 — 65 were taken. These data were used as an illustration 
for density deconvolution by Stirnemann, Comte and Samson (2012) and for distribution decon- 
volution by Dattner and Reiser (2013). We denote by Yj t \ and Yj y2 the two repeated measures of 



0.146 (0.239) 
0.187 (0.143) 
0.245 (0.436) 



0.295 (0.651) 
0.246 (0.239) 
0.470 (0.923) 



0.169 (0.516) 
0.202 (0.204) 
0.256 (0.761) 
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Fig 2. Distribution function estimation (left) and quantiles estimates (right) for systolic blood pressure of 1,615 
men aged 31 — 65 from the Framingham Heart Study. Solid line for the adaptive estimator and dashed line for the 
naive estimator. 
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SBP for each individual j at two different exams and denote by Xj the long-term average SBP of 
individual j. Then we assume that 

Y j,i = X o + S 3,U Y j,2 = Xj + s ji2 , 

for individuals j = 1, ...,n. Following Carroll et al. (2006), we use the average of the two exams 
Yj = (Y^i + Yj, 2 )/2, so that the model in our case is 

Yj = X j + e f j , (16) 

where s'j = (e,u + £j,2)/2. 

Taking advantage of the repeated measures setup, we can avoid parametric assumptions regard- 
ing the distribution of the errors. The only assumption we will make is that the distribution of the 
measurement error is symmetric around zero and does not vanish. We then set e* = (ij.i — Yj^)j1 
and note that under the symmetry assumption it is distributed as Ej. We emphasize the fact that 
our theoretical results do not require that the sample e* will be independent from that of the Yj. 
Thus, we estimate the characteristic function of the error e' by 

^ m 

Ve',m(u) = — exp(ui£*). 

3=1 

Histograms of Y' and e* are presented in Figure 1. The resulting adaptive distribution and quan- 
tiles estimates are displayed in Figure 2. With the adaptive chosen bandwidth, the Fourier trans- 
form of the kernel function Kb is supported on [—0.1,0.1] for both distribution and quantilcs 
estimation. It seems that there is no much difference between the naive and adaptive estimates 
for smaller quantiles. However, the observed differences are larger for larger quantiles. Although 
further analysis of the data may and should be done for inference purposes, we do not pursue them 
here since our goal is merely to show the applicability of the adaptive procedure to real data. 

5. Proofs 

5.1. Proofs for Section 2 

In the sequel we will use the deterministic Landau symbols O and o. For convenience we will 
write A n ($) < B n ($) if A n (i?) = 0(B n ('d)) as well. For a better readability we throughout assume 
P 7^ 1/2. In the special case (3 = 1/2 the order of the stochastic error will be (log n/n) 1 / 2 which 
can be easily seen below in the bounds (28) and (30). For the sake of clarity of our arguments 
we distinguish between n and m in this section. However, all rates are governed by n A m. The 
subscript n at the bandwidth will be omitted. 

Since 1 / ip e , m might explode for large stochastic errors we need the following lemma. The moment 
assumption on f e corresponds to the condition 7 > 2 on the set T>p(R, 7). 

Lemma 5.1. Suppose E[|e^| 2+l5 ] < 00 for some 8 > 0. Let T m — > 00 be an increasing sequence 
satisfying m 1 / 2 inf ue ^_ Tm ^ 1 |<p E (u)| > (logT m ) 2 , then for any p < 2 

P{ iuf \tp e , m (u)\ < m" 1/2 (logT m ) pN ) = o(l) as ?7Woo. 

\«e[-T m ,T m ] / 

Proof. The triangle inequality, the assumption on T m and Markov's inequality yield for some 
constant D > and for m as well as T m large enough 

P( inf be,m(u)| <m- 1/2 (logT m ) p ) 

^P(3u e [-T m ,T m ] : \<fe(u) - <p e , m (u)\ > \<p e (u)\ - mT 1 ' 2 (log T m ) p ) 
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<P( sup \ip E {u) - <p e , m (u)\ > inf \(p e (u)\ -m- 1 / 2 (logT m )f) 
<P( sup m l ' 2 \ Ve (u) - <p E , m {u)\ > ,D(logT m ) 2 - (logT m ) p ) 

\.,el-T T 1 ' 



2 

< ]R 

^(logT m ) 2 



sup m 1/2 |v? e (-u) - ip s , m (u)\ 

ue\-T m ,T m ] 



Noting l[_T ra ,r m ](«) < w(u)/w(T m ) for iy(u) := (log(e + |w|))~ 1/2 ~'' for some r/ G (0,1/2), the 
above display can be bounded by 



E 



S upm^ 2 w{u)\ip E (u)-ip E , m (u)\ < (logT m yV 2+r i (17) 



Dw(T m ){\ogT m y 

where the expectation is bounded by applying Theorem 4.1 in Neumann and Reifi (2009). □ 

To ensure consistency of the density estimator, we have to assume frequently (nAm)b 2l3+1 —> oo. 
Since this implies m 1 / 2 inf u e[— 1/6,1/6] |Ve(w)| ^ I l°g b\ 2 f° r / G T>P(R, 7) and any bandwidth which 
increases polynomially in n, Lemma 5.1 can be applied to T. m = 1/6. Hence, under this assumption 
the probability of the event 

Bs(b) :={ inf l^^l^m-^llog&l 3 / 2 } (18) 

l. «G[— 1/6,1/6] J 

tends to one. Therefore, it is sufficient to control terms on B £ (b) only. Frequently, the weaker 
estimate |y £ , m (w)| to -1 / 2 for |it| ^ l/b will be enough. 

5.1.1. Proof of Lemma 2.1 

Note that the assumptions on <p E imply |(^3~ 1 )'(it)| < (1 + M)* 3-1 as well as |(p7 1 (u)| < (1 + 
)' s , <G K. On these assumptions Sohl and Trabs (2012) have shown that (1 + iu)~" /(p E (u) is 
a Fourier multiplier on Besov spaces. Due to the regularization with the kernel, (px (bu)/fe,m {u) 
behaves basically like l/(p E and thus we can derive mapping properties of the random Fourier 
multiplier 

^):=(1 + * U )^^M, «E1. 

<Pe,m{u) 

On B E (b), as defined in (18), we will check Hormander type conditions and derive an upper bound 
for the operator norm of ip(u). More precisely, we apply Corollary 4.13 by Girardi and Weis (2003) 
with p = 2, I = 1. Hence, we have to determine a suitable constant > satisfying 



le{0,l} V J [-2,2] 



max \^ l) (u)\ 2 duj ^Af and 



max sup T l - 1/2 ( I {u)\ 2 du] ^ Ay. 

le{0S} TG[0,oo) WtsC|u|^4T 



(19) 



To find A^p, we note that 



1 ^r^ + ^'^-^S , forp e{ l,2} (20) 



\<Ps,m(u)\ P \Vs(u)\P \Ve{u)y E , m {u)\P 

and thus on B E (b) 

1 1 + A m (u) to 1 / 2 1 , s x x. 



^ \<p E ( U )\ ' " IV |l0g&|3/ 2 ' 
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m ^ \(p K (bu)\(l + A m (u)) 
W }l ^ (l+uY /2 \<Ps(u)\ ^( 1 + A '"( M )) a [-i/M/fe](")- 
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(21) 



Concerning the derivative, we estimate b ^ 2(1 + 1 u\ ) 1 for \u\ ^ 1/b and b < 1/2 and consequently 
by I^MMMISU + M)- 1 



|V'(u)| < (/3 + l)(l + U 2 )-( /3+1 )/ 2 
+ (1 + U 2 )-^ 2 



ipK(bu) 



<p' K (bu) 



<JWL + l^( ti) l 



1 + u 



¥>e,m(«) Ve,m(w) 
<^r.m( M ) 



<Pe,m(u) 



< 



1 + A m (u) A | ye,m( M ) geW 



1/6,1/6] ( U ) 



< 



< 



(l + A m ( M )) 2 ^ |^. m W-^(u)| ^ , , 

i + N I 1 + ' WM\ n-wmM 



i + a 2 ^) 

l + |u| 



(l + (1 + \u\y +1 W e , m {u) - ^)|)l h i/M/6]M- ( 22 ) 



With these bounds at hand we can show now (19). For I = the estimate (21) and 1/T < (l + |u|) 1 
for \u\ < 4T yield 



|^(«)rdu< / (l + A^(u))ll [ _i /6il/6 ](u)du ) 



T^|m|^4T 



|^(w)| 2 du 



< 



1 



T 



(! + A m( M )) ]1 [-l/6.1/6] ( u ) du 

T<|m|^4T 
,1/6 

<i+/ (i + H)X(«)<i«- 

J-l/6 



Hence, the conditions (19) for ^ = are satisfied for of the order 

/ f X / b \ 1/2 

(i + J ^{i + nr'Ai^Au) 

For / = 1 we verify by (22) and T < (1 + M) for |u| > T 



W(u)\ 2 du< / (l + A^( U ))(l + (l + | M |) 2 ^ 2 K m ( U )~^( W )| 2 )d U and 



T 



< 



T^|«|^4T 



iv'Hrdu 

Td u [ l ' b f Af n ( 



< 1 



(1 + M) 2 J_ 1/b \l + \u\ 

1/6 



+ (1 + Af»)(l + |u|) 2 *+V e , ro (u) - ^(«)| 2 d« 



1/6 



1 + u\ 



+ (1 + Al( M ))(l + |«|)^ +1 |< m (u) - ^( U )| 2 d«. 



Therefore, we find a constant A' > 0, depending only on R,f3, such that (19) holds for 

^ b fAl(u) + At n (u) 



A* :=A' 1 



-1/6 



l + M 
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+ (1 + Af„(u))(l + |u|) 2 "+V e>m (u) - ^ 8 (u)| a ) du 



1/2 



(23) 



Now, the conditions (19) imply that ip is indeed a Fourier multiplier on B e (b) and thus by Theorem 
4.8 and Corollary 4.13 by Girardi and Weis (2003) there is a universal constant C > such that 
for all 77 > and / G C s+/3+,) (R) 



T- 1 



f 



T- 1 



sC CA 



Ve,m{u) ' J C» 



Since the Fourier multiplier (1 + iu)P is an isomorphism from C S+,3 (]R) onto C S (R) (Triebel, 2010, 
Thru. 2.3.8), there is another universal constant C > such that the first assertion of the lemma 
follows: 



tp K {bu) 



< £&||/||c s +< 3 +" witri & '■= C'A$. 



To bound £ b , we apply Markov's inequality on from (23). The inequality by Rosenthal (1970) 
yields 



E 



,P/2 



(0, 



< 00 



for I = and p e N as well as I = 1 and p G {1, ... ,4}. Combined with the Cauchy-Schwarz 
inequality, we obtain 



P '^ n fc> m i/^Al |) 

P ^ (&)n K > CW/C 2 +1 Al) l) 



C /( TO i/2^+i A 1) . 
< c- 1 C ,2 (m6 2 ^ +2 A 1) E [^1 B .(6)] 

<-(m^ +2 Al)(l+ ((l + HJ^E^W+Aj.H] 



E 



< mfr 2 ^+ 2 A 1 ( 



which completes the proof. 



'-l/b 

(l + A^(«))(l + H)^ +1 K ro («)-^(«)| 

1 du 



dt/ 



1 r 1 / 6 \ 1 

,-/ (i + | M |) 2 ^ +1 d u ) <-, 
log&| 3 J- 1/b l + \u\ mj_ 1/b x I c 



(24) 
□ 



5.1.2. Proof of Proposition 2.2 

The following lemma establishes a bound for the bias term of the estimator for the distribution 
function. 



Lemma 5.2. Let Assumption 1 hold with £ = (a) +1, a > and f(» + q T ) £ C a ([— £, £], Then 
we have 

Db a+ \ 



sup 

f(»+q T )€C«a-CX],R) 



K b * f(x) dx- f(x) dx 



where D = (R/((a) + 1)1 + 2(- a - 1 )\\K(x)x a+1 \\ L i . 

Proof. Let F{x) := f(y) dy. Twofold application of Fubini's theorem yields 



K b * f(x) dx 



(x)fv(y) dy dx 



/. Dattner, M. Reiji and M. Trabs/Quantile estimation in deconvolution 

Kb(x)F(q T — x) dx, 



15 



(25) 



where Kb(x) := b 1 K(x/b),x E R. Therefore, the bias depends only locally on /. Note that 
F(m + q T ) 6 C Q+1 ([— C, CD by assumption. A Taylor expansion of F around q T yields for \bz\ < £ 



F(q T - bz) - F(q T ) = -bzF'{q T ) 



1 ] ((«> + !)! ' 



where ^ K ^ 1. Using the fact that / x k K(x) dx = for k = 1, (a) + 1 and the properties of 
the class, we obtain 



K(z)(F{q T -bz)-F(q T ))dz 



\<C/b 



(z)(-bz) {a)+1 - { — — >- - ^ dz 

[)[ ' ((«)+!)! 



b^ +1 R 
««>+!)! 
/ b a+1 R 



K b * f{x) - f{x)) dx 
K s 

K(z)\\F(q r -bz)-F{q T )\te 

|A(z)||z| (Q)+1 |^r +1_(<a>+1) + 2 
\Kiz)\\z\ a+1 dz, 



\z\>c/b 



\K h iz)\dz 



b^ 



and the statement follows. 



□ 



Proof of Proposition 2.2. We will show uniformly over / € C Q (i?, r, Q and / e € T>° iR, 7) for any 
& such that (n A m)b 2 P +1 —> 00 



if b (x)-f(x))dx 



Op[b 



yjinhm)^- 1 A 1) ^(nAm)(m6 2 ' 3 + 2 Al) 



The third term on the right-hand side is of smaller or of the same order than the second one 
if and only if (to6 1a2/3+2 ) _1 < 1. Hence, when a ^ 1/2 the asymptotically optimal choice b = 

i nAm )-U(^+^Vl/2) + l) yieldg 

' T (fb(x) - fix)) dx = P (in A TO )-(«+i)A 2 «+ 2 /3+i) v ( n A to)- 1 / 2 ) . 

00 

Step 1: As usual we decompose the error into a deterministic error term and a stochastic error 
term, writting ipx — F f, 



hix) - fix) dx 



A 



K b * fix) - fix)dx 



+ 



ip n ju)ipK(bu) 
Vcmiu) 



ip K ibu)<pxiu) ix)dx 



The bias is of order 0(b a+1 ) by Lemma 5.2. As discussed above, we decompose the stochastic error 
into a singular part and a continuous one using a smooth truncation function. Let a c G C°°(R) 
satisfy a c (x) = 1 for x ^ — 1 and a c {x) = for x ^ and define a s (a;) := l(_oo,o] l 21 ) ~ <2c0*0- Then 



</>e,m(«) 



a s (x)J" 1 ip K ibu)( Vn 7 - (.T + q T )d.T 
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+ / a c (x) T~ x \<p K (bu)( V? " ( " t) , -<px{uj)](x + q T )dx=:T s + T c . (26) 

The singular term T s will be treated in the next step while we bound the continuous, but not 
intcgrable term T c in Step 3. 

Step 2: Recalling the definition (18) of the event B e {b), let us denote its complement by B E (b) c . 
Lemma 5.1 shows that B e (b) c is asymptotically a null set. We obtain for any c > with Markov's 
inequality 



F T, > 



^nAm)^- 1 V 1) 



s (6)n{|T.| 



> 



(-^(nAm)^ 1 V 1)1 



V(nAm)(() 2 Hvi) 

o(l) 



| Ts | lB e (6) 



To bound E[|T s |]l Be r b \], we first note by Plancherel's identity 



2tt 
1 



\- [ Fa s {u)e- m ^ VK {bu)(^^- - Vx {u))du 

r/>_ fill \ 

d» 



,- / Ta s {u)e- iu ^ VK {bu)(^^-^ x {4 
Jr > 

27T J R (/9 £ (m) \tp eim {U) I 27T V 



(27) 



The first term T s ^ x corresponds to the error due to the unknown density / while T Si£ is dominated 
by error of the estimator tp £ , m - Since a s is of bounded variation and has compact support, there is 
some constant A s £ (0, oo) such that | T a s (u)\ ^ A s (l + |u|) _1 . We then obtain with Plancherel's 
identity 



n 



T a s (u)e 



< — IIA-lloo 

n 



<Pe(u) 



du 



^\\K\\Ufr\\oo J_ 
<i||tf|M||/y||=o/ 



V l J-q s ( M )| 2 
\<p e (u)\* 



L- 



1/6 



du 
1 



1/b (l + | u |)a|^(«)P 



■ du. 



Using the assumption \\f\\oo < R an d f e & ^(-R, 7), we get 



n\T s . 



< - 



1/6 i 1 

(i + H) 2 ^- 2 du< -^Uvi 

-1/6 



(28) 



To bound T Sj£ , we will use the following version of a lemma by Neumann (1997): By the definition 
(18) of B e (b) and applying (20) it holds 



E 



fe,m(u) 



11 



Beib) 



C 2] 



\<Ps,m{u) ~ fe{u)\" 



2E 



\<Pe,m(u) ~ <p £ {u)\ 



\fe{u)if e ,m{u)\ 



2E[|p e , m (u) -</?e(u)| 2 ] , 27nE[|^, rn ( u )-v ?e ( u )| 4 
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m\(p e (u)\ 2 

Now, we estimate with the Cauchy-Schwarz inequality 



(29) 



1/6 



-1/6 



Vn(w) 



du 



1/6 



-1/6 



<p E (u) 



(llf 
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< 211*11^ + 



|^„(u)-^(u)| 2 d ^ f 1/b 



-1/6 



\Ta s (u)\ : 



-1/6 



du. 



Applying again the Cauchy-Schwarz inequality, Fubini's theorem, the decay of J- a s and (29), we 
obtain 



E[|T,, 8 |1 B , (6) ] <v^||X|| il (||^||i 2 + 



1/6 



IVe(")| 5 



E 



-1/6 (1 + M) 2 L Ve,m(u) 



36||/f|| L iA s 



1/6 



\\<PX\ 



2 

L- 



l/b 



- 1 



1 



l B c {b) 



du 



1/2 



1/6 H<ArO)| 2 
1/2 



1/2 



L 1/b (i + |u|)»|^(«)P 

The assumptions ||/||oo < 1 and |y e (u)| £ (1 + M)"' 3 yield 



E[|T s , £ |l Se(6) ]< 1 



1 \ V2 



V 



du 



1 



(30) 



'5/3-1/2 



< 



-1/2 



The last estimate follows from the fact that the choice of b as stated in the theorem guarantees 
that ?i~ 1 6~ 2 ' 3 ~ 1 ->0asn->oo. The last inequality together with (28) and (27) imply the optimal 
order 

E[|T„|1 B . (6) ] < ((nAm)(&^- 1 Al))" V2 . 

Step 3: Let us define the empirical measures of (Yj) and (sk) as fiy.n := — V,"— i Syj and 
Me,m : — ^ SfcLi ^£fci respectively, where <5 X is the Dirac measure in i G 1. We can write 

T c = ( acix)?" 1 (ip n (u) - ips. m (u)<px(uj) (x + q T )dx 

Jus Lca, „ (u \ /J 



= J"" 



Ve,m(— W) V / J 

' (fen * fflo(-«) - ^e,m * / * «<:(—•)) (.Qt)- 



-<Pe t m(u) 

Applying Lemma 2.1, wc obtain on B £ (b) for any integer s > ft 

(p K (bu 



T, C 



* fen * O c (— •) - /i £ , ro * / * O c (— •) 



-<Pe,m(u) 

< <?6 fen * Ct c (-«) - Me,m * / * O c (-«) 



= ^6^ ||fen * ac\- m ) - Me.m * / * ( — •) I 



1=0 



Therefore, 



P 



(B e (b) n {|r c | 



> 



v / (nAm)( v /mfii , + 1 A 1) 



i a nl) 



(6) n > ( 



l) 1/2 }) 

c iioo \nAmJ J 



mh 2f}+2 A 



fen * a y c - fi em * 



1=0 



1/2 N 



Pi 
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By Lemma 2.1, more precisely estimate (24), the first probability is of the order 1/c. To bound P2, 
is suffices to show ||/iy. n * ai^ — \i e . m * / * a^P || = (9p((n A ?n)~ 1 / 2 ) for alii = 0, . . . , s. Denoting 
the density of Yj as fy = f * f s , we decompose 

\\f, Y>n * (o«(-.)) - * / * (oW(-.))L 

<|| M y,„ * (o«(-.)) - /y * (a«(-))L + II/* * (/ * ))) - Me,™ * (/ * (o«(-. 

oW(y-.)My,n(dy)-E[oW(yi-.)] 

E[(/ * o«)(ei -•)]-/(/* - ')^, m (dz) 



By construction all a£ , Z 5^ 1, have compact support and are bounded. Therefore, ||<ic ||z,i < 00 
and |j(a c * /)^'||li ^ Hac^l^H/IlL 1 < 00 and thus the functions a c (« — i) and a c * /(• — t) for all 
t £ M are of bounded variation. Since the set of functions with bounded variation is a Donskcr class 
(cf. Theorem 2.1 by Dudley (1992)), the two terms in the previous display converge in probability 
to a tight limit with -y/ri-rate and -y/m-rate, respectively. Consequently, 

V^A^\\[iY,n * (a®(-.)) - H s , m * f * (4°(-»))L = °f(V 

for all I = 0, . . . , s and P2 is arbitrary small for c large. □ 

For the adaptive estimator we will later need the following uniform version of Proposition 2.2. 

Corollary 5.3. Suppose Assumption 1 holds with I = (a) + 1 and let B be a finite set of bandwidths 
with b\ = mini3 such that mb\^ 1+1 — > 00. For a sequence of critical values (Sb)b£B satisfying 
8b > 3Db a+1 and for any sequence (x n ) n with x n — > 00 arbitrary slowly we obtain uniformly in 
C a (R,r,Q andV f3 (R,~f) 



P 3b eB 



(f b (x) - f(x)) dx 



= °(E(j-((-AH(^- 1 Al))- 1/2 + i 



bee 



5% n(mb 2 P+ 2 A 1) 



o(l). 



In particular, if \B\ < logn, max(, e g b — > and min;, e g(n A m)b 2l3+1 — > 00 



P[ sup 



h{x) - f(x) dx 



> 6 



for all 5>0. 



J-00 

Proof. With the notation of the proof of Proposition 2.2 and applying Lemma 5.2, we obtain 



(h(x)-f(x))dx 



(K b * f{x) - /(*)) dx 



+ T r < Db 



a+l 



T s + T c , 



where T s and T c are the stochastic errors of the singular part and of the continuous part, respec- 
tively, as defined in (26). Since both terms depend on b let us write T s (b) and T c (b). By definition 
61 ^ b implies B £ (bi) C B £ (b). Then, Step 2 in the previous proof shows 

P(3b eB:T s > S b /3) ^P(3b G B : T s (b) > S b /'S} n B e {h)) + P(B £ (foi) c ) 

^(E P (i T ^ 6 ) > V3}DB e (6i))) +o(l) 



beB 



(E« 5 r 1 E[|T s (6)|]l Be(6l) ])+ (l) 



tee 



-1/2 



bee 



o(l). 
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Following Step 3 in the previous proof, we obtain with the random operator norm £ b , for some 
integer s > (3 and for a diverging sequence (x n ) 

P(3b €B:T C > S b /3) ^P({3b e B : £ b > W /2 /(3K) 1/2 )} n B £ (h)) + P(B £ (&i) c ) 

n 



+ ^({Ell^ n -«-Me, m */*ai')|| oo >(^) / }) 
< ( P (i £ b > <W /2 /(3(^) 1/2 )} n B E (bi))) + o(l) 



< 



beB 

bee 



(5 2 n(m& 2 £+ 2 A 1) 



+ o(l), 



where we have used (24) in the last estimate. 



□ 



5.1.3. Proof of Proposition 2.3 

Without loss of generality we set q T = 0. Recall definition (10) of the pseudo-estimator f b which 
knows the error distribution. We estimate 

sup \f b (x) - f(x)\ < sup \f b (x) - f(x)\ + \\f b - / 6 | |oo 

se(-CC) xe(-c,C) 

ip K (bu)tp n {u) t <p s (u) 



< sup |/ 6 (x) - + 
<se(-C,0 



V3 E ( U ) Vy> e , m (u) 



Li 



The analysis of the first term is very classical. However, the authors are not aware of any reference 
in the given setup. Both terms will be treated separately in the following two steps. All estimates 
will be uniform in / G C a (R,r,() and f e G V^(R,j). 

Step 1: Let b € (0,1). We will show that there are constants d, D > such that for any 
t > d{b a + {nb 2 ^ 1 )- 2 ) 



P( sup \f b (x) - f(x)\ > t) ^ 2exp (21ogn - Dnb {2fi+1 \t A t 2 ) 



'ze(-CC) 

Then the result follows by choosing t ~ b a 
k = 1, . . . , [2(n 2 \ =: M as well as 



(31) 



log n \ 1/2 



Let us define x^ := —( + kn 2 for 



vk(H e i UYj 



(x) — E 



<PK(bu) ciuY . 



(X) 



-K b * J 7-1 



H[-b-i,b-i]('«) — (x) - -K" b * /(x), x e 



Therefore, / 6 (x) - E[f b (x)} = i £)" =1 Xj'fc) and thus 



sup |/ 6 (x) - /(x)| < sup |E[/ b (x)] - f(x)\ + sup |/ fe (x) -E[/ 6 (x) 

\x\<( \x\<C |a=|<C 



I 1 " 

s$ sup |E[/ 6 (x)] - /(x)| + sup min - V(xj(x)-Xj(a;fc)) 

|x|<C |x|<C fc=1 "" ,A * 1 n 

I 1 " 



:£ + V 1 + V 2 . 
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The bias term B can be bounded as in the classical density estimation setup (cf. also Fan, 1991, 
Thm. 1 and 2), noting that the constant does not depend on x <G (— C,C)- Hence, \B\ < b a . Using 
a continuity argument and the properties of f s € D^(R, 7), the term V\ can be bounded by 



\Vi\< 



1 11 1 



00 r 



e iuYj 



f) 



< -L\\K'\\ Ll (\\l hb - Kb -^\\ L1 + ll/IU) < n-H-^+V < (nb^)- 2 - 

Therefore, \B + Vi j ^ D^b* + (nb 2l3+1 )~ 2 ) for some constant D x > 0. We obtain for all t > 
d{b a + (nb 2l3+1 )~ 2 ) with d := 2D 1 



1 71 t ^ 1 71 

^(sup \f b (x)-f(x)\ >t) <p( fc max Af |-g Xj -(^)| > 5) ^E P (|-E^) 



fc=i 



>2 



Finally, we will apply Bernstein's inequality. To this end we estimate 

m&x\ X j(x k )\ <2||/v b || L1 ||]l [ _ 6 - ljb - 1] ^- 1 || i i < D 2 b- (fj+1 \ 

with some constant D2 > 0. Using Plancherel's identity, the variance can be estimated by 



F- 1 



^2~J^ 



ip K {~bu) 2 



L- 



(x k )\ - (K h * f) 2 {xk) 
<D 3 b~W +1 \ 



(p e (-u) 

for some D3 > 0. Then Bernstein's inequality yields 

M n 

P( sup \f b (x)-f(x)\ >t) <Y>(| Y,Xd(xk)\ >nt/2) 



fc=l J=l 

sC 2M exp 



2 exp ( log M 



8(£> 3 + D 2 t/3) 
^ 2 exp ( 2\ogn - Dnb {2f3+1) (t At 2 )), 



with some constant D > 0. 

Step 2: By the Cauchy-Schwarz inequality we have 

ip K (bu)tp n (u) ( <p £ (u) 



E 
<( E 



tp B (u) \if E . m (u) 



1 



L 1 



u 



B E (6) 



l[-l/6,l/6] («) 



2 - 








L 2 - 


E 




( 



l)]l[-i/&,i/b](w] 



L 2 



< 



+ , *4)_|iMa d „)'' 2 )( r «ri j&feL . 1 

W-l/6 l^eWr 7 JK J-l/b L Ve,m(") 



ll^ll 



L 2 



'-1/6 

1 \V2 



1/2 



df/ 



1/2 



^2/3+1 



1 \V2 



^2/3 + 1 



where we have used (29) for the last step. Therefore, the additional error due to the unknown 
error distribution satisfies for any 5 > by Markov's inequality and by Lemma 5.1 



ip K (bu)tp n (u) / ip £ (u 



- 1 



L 1 



>S) - E 



(p K (bu)tp n (u) ( (f e (u) 



rjPeW_ 1 \ 

\u) J 



fe{u) \ip E , m {u) 

+ P( inf \<p e , m (u)\ <m~ 1 /2 
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and thus — /b||oo = Op((mb 2l3+1 )~ 1 / 2 ). Note that the second term does not depend on 6 and 
thus o(l) is sufficient. □ 



5.1.4- Proof of Theorem 2.4 

We start with a lemma that establishes consistency of the quantile estimator and then prove the 
theorem. To apply this lemma also for the adaptive result, we prove convergence uniformly over a 
set of bandwidths. 

Lemma 5.4. Grant Assumption 1 with I = 1. Let B be a set of bandwidths satisfying \B\ < 
log n, rnaxS — > and min(, e g(log n) 2 /((n A m)b 2l3+1 ) — > 0. TTierc 

sup sup P(sup |g T) b — q r | > <5) — > /or all 6 > 0. 

f&C"{R,r,i)heDP{R,i) beB 

Proof. We follow the general strategy of the proof of Theorem 5.7 by van der Vaart (1998) 
in the classical M-estimation setting. First, we show that / satisfies the uniqueness condition 
inf„.in- 



\M'(rf)\ > for any 6 > and with M'(t)) = j2 oa f(x)dx - r, which is the 
deterministic counterpart of M b defined in (7). By the Holder regularity M"(r\) = f{rj) ^ 



17: 1^7— I ><5 



f(qr) ~ \f(qr) - f(v)\ > r - R\q T - > r/2 for \q T - r)\ (^) 

generality we can assume 6 ^ (jp) 1Va 



r \1Vq 1 



Without loss of 

otherwise consider the minimum o"A(^) 1Va .Recall 



that the true r-quantile q T is given by the root of M' and that M' is monotone increasing. Hence, 
we obtain 



inf \M'(t))\= inf \M' (q T - rj) - M' (q T )\ ^ 5 inf M"(r?) ^ 



8r 



vAv—QtI^s 



?)G{-5,5} 



Recall that q T>b is given as the root of the estimating equation (7) on the interval [—U n , U n ] for 
U n < logn. Therefore, 

P(sup\q T . b -q T \>S) ^P(su P \M'(q T . b )\^Sr/2) 
beB beB 

= P(sup \M'(q T . b ) - Mi{q T , b )\ > Sr/2) 

beB 



^P(sup sup \M'{ri)-M' b {ri)\^8r/2) 

beB ne[-U n! U n ] 



PI sup sup 

V beB ne[-u n ,u„ 



(f b (x)-f(x))dx >6r/2), 



(33) 



where we have used M b (q T ^ b ) = 0. Hence, it remains to show uniform consistency of fb{x) Ax 
Write, 



(f b (x)-f(x))dx 



< / (K b *f(x)-f(x))dx 



(f b (x)-K b *f(x))dx 



\K b *F(r,)-F(r,)\ 



(f b (x) - K b * f(x))dx 



We have \K b *F{rj)-F{rj)\ = \J K b (z)(F(7]-z)-F(r]))dz\ < 6||/||oo||«-K'(»)IUi by the boundedness 
of /. Further note for n e [—U n , U n ] 



(fb(x) - K b * f{x))dx 
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{f b (x)-K b *f(x))dx 



q T Vn 



(f b {x) - K b * f(x))dx 



g x Ar; 



(f b (x)-K b *f(x))dx / (f b (x)-K b * f(x)) 2 dx 



1/2 



where we have used the Cauchy-Schwarz inequality for the last step. Hence, together with (33) we 
obtain for all S > 6\\f\\ oc \\zK(z)\\ L i/r sup 6GB b 



P(sup \q T>b - q T \ > 5) < P( sup sup / (f b (x) - f(x))dx ^ Sr/2 

beB K beB ne[-U n ,U n ] J-oo 

It _ 

(fb(x) - Kb * f(x)) dx > 5r/6 



«S P( sup 

■ 6eB 



+ P( sup / - Kb * f{x)j 2 dx > <5 2 r 2 /(72£/„) 

■ bee 



Corollary 5.3 shows under the conditions on B that 



P{ sup / fb(x)-K 

V beB J-oo 



6 */(a;)dx >fr/3 -> 



Hence, it remains to show 



P 



(sup / (/ 6 (z) - K b * f(x)) 2 dx > 6 2 r 2 /(72U n )) 

v beB -'in 



-> 0. 



(34) 



On the event B e (b), (34) follows basically from the work of Neumann (1997). More precisely, 
Plancherel's equality, (29) and the Cauchy-Schwarz inequality yield for any b € B 



E 



(f b (x)-K b *f(x)) 2 dxl Bs(b) 

<Pn{u) (Py{u) 2 



I \ m (bu)\ 2 E 



1 



B e (b) 



du 



< 



< 



< 



1/6 



E 



|y?»(u) - v?y(m)P 

|^ e ,m(w)| 2 



-lib 



BAb) 



1 



1/6 

1/6 



z 



(1 + m\<f) e , m (u) - (fe(u)\ 2 ) 



(p E ,m(u) <Pe(u) 

\vy(u)\ 2 



m\ip e (u)\ l 



du 



du 



1/6 \<Ps{u)\ 
1/6 



I_(7e f|p n (u) - <py(u)\*\ E [2 + 2m 2 |<p e , m (u) - ^ e ( W )| 4 l + l^Ml 
lu)r V V L J L J / m 



du 



1/6 



|¥> e (u)| (n +m l )du 



< 



(n A m)6 2 ' 3 



+1 



Noting that S e (minB) C B e (b) and applying Lemma 5.1, (34) follows then from Markov's in- 
equality 



P sup / (/ t (x) - K b * f(x)Y dx > S 2 r z /(72U n ) 

■ beB 



<U n 5- 2 VE f / (/ b (x) - K b * f(x)) 2 dxl 

urn L "'K 

0(1). 



Be(minB) 



< 



bee 
(logn) 2 



P((P £ (min2?)) c 



'i 2 (nAm)F+ 1 



□ 
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Proof of Theorem 2.4- By a Taylor expansion argument we have 

K(q T ) _ fl^ fb(x) dx-r _ J^iMx) - f(x)) dx 



Qr.b - q T = 



Ml'(q*) 2f b (q* T ) 2f b (q*) 



for some intermediate point q* between q T and q T _ b . By Proposition 2.2, the numerator in the above 
display is of order Op(?r' a+I '/' 2a+2 ' i+1 ') for the optimal bandwidth b*. For the denominator we 
will show fb{q*) = /(?t) + °p(l) which completes the proof. Since /(• + q T ) € C Q ([— (, Q, R), we 
obtain \ f{x + q T ) - f(q T )\ < t/2 for all |x| ^ (^i?) 1 ^" 1 A ( =: S for any t > 0. Therefore, 

P(\fb(qr.b) - f(q T )\ > t) ^P{ sup \J b (x + q T ) - f(q T )\ > t) + P(\q T . b - q T \ > S) 

x<£[-S,S] 

^P( sup \J b {x + q T )- f{x + q T )\>t/2)+P{\q T , b -q T \> 8). (35) 

Checking that the bandwidth satisfies b — > and log(n) / (nb 2l3+1 ) — > for n — s- oo , the first term on 
the right hand side above converges to zero by the uniform consistency proved in Proposition 2.3. 
The second one vanishes by Lemma 5.4. □ 

5.2. Proofs for Section 3 

We start with showing that the construction of B n from Lemma 3.1 satisfies Assumption 2. 
5.2.1. Proof of Lemma 3.1 

The deterministic counterpart of b m i„, which is defined in (12), is given by 

r /loen\ 1 / 2 f l / b "i 

b min := min |& G A n : 2 < (^-J J / du < 4 |- 

Noting that for f £ G V f3 (R,-/) 

'l0Kn\V2 /-Vftmin / logn ,1/2 



4 ^ . 

-l/6min ' ' ~ 

we obtain nb 2 ^^ 2 — > oo and thus it is sufficient to prove the following 

inf inf P(b min ^ b min ^ b*) ->• 1 as n ->• oo. (36) 

/£C«(K,r,C)/se^(K,7) 

for the optimal bandwidth 6* = ?7 ,- 1 / 2q + 2 (^ v1 / 2 )+ 1 f or convenience we define 

V " y J-l/b We{u)\ \ U J J_ 1/b \<p £ , m {u)\ 

Assume b m i n < b m i n , then monotonicity implies I n (b m i n ) ^ I n ipmin) *S 1- Combining with 
I n (b m in) > 2, we obtain I n {b min ) - I n {b min ) ^ 1. Hence, 

P{f*min ^ ^rnm) ^ -^^(l-^n^mm) -^77(^777777)! ^ -0 ■ (^7) 

On the other hand, if 6* < b min , we get ?„(&*) ^ I n (b mm ) > 1/2. Since /„(&*) < (^P+2) 1/2 
converges to zero, I n (b*) ^1/4 for n large enough. Thus, 

P(b mm > b*) < P(|I„(6*) - 7„(6*)| > 1/4). (38) 
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To show that the right-hand sides of (37) and (38) converge to zero, we first apply the Cauchy- 
Schwarz inequality 



™ r/Mi 2 . 10 ^ f 1/b du f 1/b 

,(b)-I n (b)\ ^ / | — 1-^2 

n 7-1/6 WeWl 2 7-1/6 



< 



logn 



1/6 



-1/6 



tp s (u) 



2 

du. 



du 



By Lemma 5.1 we can consider the event B e (b) only and thus Markov's inequality and (29) yield 



p({\i n (b) - / n (6)| > |} n B e (b)) < JgL £ /6 



-1/6 



y> e (u) 



<^£,m(u) 



ifi £ (6) 



du < 



logn 



which converges to zero for b £ {b m i n , b*}. Therefore, (36) holds true. □ 
Before we can prove Theorem 3.2, some preparations are needed. By Lemma 5.2 there is a 
constant D > such that the bias can be bounded by Bb := Db a+1 . By the error representation 
(8) we have for any b £ B 



|?t,6 - Qt\ = 



I 9 _ r 00 (f b (x)-f(x))dx 



2/6(9* 



gh + |Vfo + V&, E + H,. 
2|/6(g*)| 



with some q* £ [(q T A q T .b), (<Zt V <7 T ,&)] and where the stochastic error is decomposed in 
1 " 

V b ,x:=-Y,&( b )- E Mb)}) with 

-cp K (bu)e iuY * 



(39) 



(40) 



V6, e 
V6,c 



a s (x) J" 



— oo 




L <^ e (ii) 
(p K (bu)<p n (u) ( (fi £ (u) 



(^rr- 1 ) 



</?n(«) 



(x + q T ) dx, 
(x + q T ) dx 



a c {x)F 1 ifK(bu)( — 7 r (x + q r )d.T 

L V(p ejI „(u) 



(41) 
(42) 



In view of the analysis in Section 5.1.2, the part of the stochastic error which is due to the 
continuous part a c will be negligible. Hence, we concentrate on Vb.x and Vb, £ . By independence of 
{£j{b))j, we obtain 



1 



1 



Var(y h ,jf) <-E[^-(6) 2 ] = -E / a s (x) T' 1 



<p K (buy uY n ^ 
(x + q T j dx 



'b,X- 



(43) 



To determine the variance of Vb )E , it will be helpful again to restrict to the event B e (b), defined 
in (18). We apply Plancherel's identity and the Cauchy-Schwarz inequality to separate Yi and 
from each other since they are not necessarily independent: 



:[\v Ke \i BAb) ] ~e 



Fa 



1/ 



<- 



2tt 



\<p K (bu)\ 



x ( / WK{bu)\\F a s (-u)\ 
v 7r 

( / \VK{bu)\ 

^ .ID 



du 



1/2 

ip e {u) 



1/2 
du) 11 



B e (b) 



fe{u) 



Let us define 
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X SUp \(fi e , m (u) - ip E (u)\[ / |^jf(6u)| 
|«|<l/6 



dull 



B E (b) 
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(44) 



1 -1/2 
Z7T 



(45) 



with 



°~b,c,2 ■= E 



\(f K {bu)\ 
\<p K (bu)\ 



<p e (u) 
Fa s {-u) 



2 \l/2 

du 



2 s 1/2 

du] 11 



B e (b) 



With the bounds (Jb,x and a b ^ £ at hand, we obtain the following concentration results. 

Lemma 5.5. Let B be a set satisfying \B\ < logn, (log log n)/nb\ — > for b\ = minf? as well as 
| log 6i | < logn. Then we obtain uniformly over f G C a (R,r,(^) and f e £ T>@(R,~f) for any 6 > 0: 

(i) p(3beB: \V b<x \ > (1 + 8)y/loglogn(V2<T b , x + o{n- 1 ' 2 (b- f)+1 ' 2 V 1)))) 0. 
(it) p(3b£B:\ V btE | ^ 6 (log n) 3 a M ) -> 0. 



(Hi) Assuming further mb± 2 > 1, 

Pf36 G 6 : \ V b ,c\ > (log7i) 3 / 2 n- 1 / 2 (6-' 3+1 / 2 V 1)) -> 0. 



Proof, (i) Using Bernstein's inequality, we start with proving uniformly over / € C Q (i?,r, £) and 
/ e G 2? /3 (i?,7) for any positive «„ = o(nb) 



P(\V b ,x\ > ^(V2a b , x + o{n- l ' 2 {b-^ +1 l 2 V 1)))) < 2 e - K " 



(46) 



The assertion follows then from choosing k„ = (1 + S) 2 log log n for some <5 > and \B\ < logn. 
Plancherel's identity yields the deterministic bound 



< 



.Fa g (u) 

-1/6 <£>e( u ) 
1/6 



du 



,1/6 

/ |J"a s (u) 

J-l/6 



ip K {bu) 



du 



1 



-1/6 (l + \u\)\ Ve { U )\ 



du < 



Hence, \£j(b) — E[£j(6)]| ^ Cb " for some constant C > 0. Since the variance is bounded by (43), 
Bernstein's inequality (e.g. Massart (2007), Prop. 2.9) yields 



>(\Vb. 



■\ > \/ 2<T t,X K r, 



) < 2e~ K " 



Therefore, (46) follows from ^(r^)" 1 < (n(6 2 ' 3 - 1 A l))- l / 2 (n n / (nb)) 1/2 . 
(ii) Using an estimate as in (44), we obtain 



\v b , e \ < — ( / \m(bu)\ 



< ^( / \<PK(bu)\ 



Vn{u) 2 



du 



\<p K (bu)\ 



du 



1/2 
1/2 



\<p K {bu)\\Fa s (-u)\' 



ip s (u) ~ fe,m{u) 



du 



1/2 



Fa s {—u) 



2 N l/2 

du 



SUp \<p s (u) - ip E ,m(u)\ 
u|^l/6 



1 



7^V6, e ,lV6,e,2 SUp I if e (it) — tp Em (u) I . 
|«|<l/6 
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Hence for any c > 

-P({|Vb, e | > S(logn) 3 a b>s } HB e (by 

< P(\V b , £ ,i\ > (logn) 1 ^,,,!) + P({|V r 6>El a| > (logn) 1+c a b , £i2 } D B e (b)) 

+ P( sup |^H-^e.mH| > Silogn) 1 - 2 ^-^ 2 ) =:P M +P M + P M . 

v |u|^l/b y 

The first two probabilities can be bounded by Markov's inequality: 

P M ^ (logn)- 1 -^^^^,,!] = (logn)- 1 - c , 

P b ,2 < (logn)- 1 -^^^,,^^^)] = (logTi)- 1 -^ 

For Pb j3 we will apply the following version of Talagrand's inequality (cf. Massart (2007), (5.50)): 
Let T be a countable index and for all t € T let Zij, . . . , Z n ,t be an i.i.d. sample of centered, 
complex valued random variables satisfying || ^fc,* || oo ^ b, for all t £ T,k = l,...,n, as well as 
sup tgT Var(^J! =1 Zk y t) ^ v < oo. Then for all k > 



p(snp\Y,Z k ,t ^4E[sup|^]Z fc , t 



■ tGT 



k=l 



tGT 



k=l 



/ 2vn+ -bn) sC 2e _K . 



(47) 



Choosing the rational numbers T — Qfl[— -|, |] and t := e ltefc — y> e {t), Talagrand's inequality 
applies with 6 = 2 and v — n. As in (17) we use Theorem 4.1 by Neumann and Reifi (2009) to 
obtain for any rj £ (0, 1/2) 



Sup \<Pe, m (t)-<P&)\ <|l0gfo| 1/2+ ". 
\u\^l/b J 



m 1/2 E 

Therefore on the assumptions K~ 1 (log n) 1+2v and n n /m 
4E 



Sup \<p e , m (u) - <Pe(u) 

|«|<i/6,«eQ 



and thus continuity of ip et m and (47) yield 



— + ^*n = J— (V2 + 0(l)) 

to oto V ?n 



A,8 = P 



sup 

|«|<i/6,ue 



- ¥>e(«)| > (V^ + o(l)) yjK n /rnJ 2e 



With K n = |(logn) 2 " 4c for c < 1/4 - rj/2 we obtain P 3 < 2n-' 5 / 2 . 
Using bi = min£>, | log£?| < logn and Lemma 5.1, we finally get 

P(sup \V b>e \ > (V2 + <5)(logn)V e ) < £ (P M + P 6>2 + P M ) + P(P e (&i) c ) = o(l). 



■ 6GB 



5G8 



(mj Corollary 5.3 shows for 5f, > and for any sequence (x n ) n that tends to infinity 



p{3beB: \V b . c \ >S b )<J2 



bEB 



8 2 n{mb 2 ^ 2 A 1) 



o(l). 



So, choosing S b = (logn) 3 / 2 n 1 / 2 {b > 3 + 1 / 2 V 1) and x n = o((logn) 1/2 ) yields 
Pf36e6: \V b J > (Iogn) 3 / a n- 1 / a (6-/'+Va V i) > 



< 



E 

6GB 



(logn) 3 (TO6( 2 ^ A1 )+ 2 A 1) 



0(1) 



< 



(log?i) 2 (TO6( 2 ' 3A1 )+ 2 Al) 



+ o(l) = o(l). 



□ 
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For the denominator in the error representation (39) we need uniform consistency. A uniform 
result on the error \q T ,b ~ ?t| follows immediately. 

Lemma 5.6. Let B be a finite set satisfying \B\ < logrt, sup bee b\og(n) — > as well as 
sup fce g(log?i) 2 /(n6 2 ' 3+1 ) — > 0. Then we obtain for n — > oo and rj £ (0, 1) 



sup sup Pi sup sup |/&(<?*) - /(g r )l > VfiQr, 

f£C<*(R,rX) fe€T>P(R,-y) K beB q% S [q T Ag T , b ,q T Vg T , b ] 



0. 



(48) 



Moreover, supposing min^gg no ( 2 P M )+ 2 > i j we obtain uniformly in f £ C a (R,r,() and f e £ 
T>P(R,j) for any sequence of critical values (6b)beB satisfying infg 6 b — > oo 



P(3b e B : \q Tib - q T \ > 6 b (3Db a+1 + n- 1 ' 2 (b- p+1 ' 3 V 1))) < ]T j + o(l). 



(49) 



beB 



Proof. Since f(q T ) ^ r and / e C a {[q T — (,q T + Q,R), decomposition (35) implies with n = 

^2R> 



(50) 



Pi sup sup 1/6(9*) - f(Qr)\ > r]f{q T ) 

V beB q*e[q T Aq Ttb ,q T Vq Ttb ] 

^P(sup sup \f b (x + q T ) - f(x + q T )\ > nr/2) + P(sup|$ T , b - q b \ > k). 

beB x£[-k,k] beB 

Using b\ = vaxsxB, the first probability can be bounded by 

Y, P ({ SU P \fb(x + q T )- f(x + q T )\>Vr/2}nB e (b 1 fj +P(B £ (b 1 ) c ) 



beB 



<lognsupP({ sup \f b (x + q T )-f(x + q T )\>T 1 r/2\nB e (b 1 ))+o(l)=o(l), 
beB K ^xe[-K,K] ' ' 

since for all b the probability in the last line converges faster to zero than 1 / log n owing to the 
concentration inequalities (31) and (32) and the conditions on b. To estimate the second term in 
(50), we apply Lemma 5.4. Therefore, the conditions 61og(n) — > and (log n) 2 /(nb 2 ^ +1 ) — > yield 
the first assertion. 

The estimate (49) follows from the error decomposition (8), (48) and Corollary 5.3 with x n = 
o(inf e 5 b ) 



<P[3beB 



P(3b e B : \q T ,b -qr\> 6 b (3Db a+1 + »-V2( 6 -/M-l/2 v ^ 

T f b (x) - fix) dx > \f{q T )8b(ZDb a+1 + n- x ' 2 {b-^ +1 ' 2 V 1))) 
\fb{q*)-f{qr)\>\f{q T f l 



+ Pi sup sup 

^ beB g;e[g T Ag Ti6 ,g T V§V i6 ] 

< V (- + — — 

~^\8 h S 2 mb l ^+ 2 A 



beB 



□ 



beB 



The variances o b .x and crf, je , defined in (43) and (45), arc of course not available in practice. 
Instead, they can be estimated by 



r 2 -± 

h ' x ~n 2 



E 

3=1 



a s (x) T~ 



(p K (bu)e 



iliYi 



with 



^ 2 

a b,e,l 



1/6 



\ip K {bu)\ 



-1/6 



Ve,m(u) 
<Pn(u) 2 



\ 2 

(s + q T>b ) dx) , o~ 



2 

b,e 



1 -1/2—2 ~2 
2^ °b,e,l°M,2 



du > °6, e ,2 



1/b . „ jPa s (u)| 2 , 

-1/6 IVe,mr 



The following two lemmas show that these estimators are indeed reasonable. 
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Lemma 5.7. Let B be a finite set satisfying \B\ < logn, max^g b a logn — > as weZZ as 
miiibge nb 2 P +2 — > oo. TTien we obtain for all r\ > as n — > oo 



sup sup p(36eS: |3F 6 , X - <t 6 , x | > ^(rW 2 V 1)) 



0. 



Proof. Note that 



^ 2 

a b,X 



3=1 i =1 3— i 
2 A . „ 2 A 2 



(51) 



3=1 



i=i 



i=i 



where we have defined 









fci (6) 


= / a s (x] 


T 




J — oo 












= / a s (x) 


T 




J — OO 










0.3(6) 


= / a s (x] 

J —OO 


T 



(p K (bu)e iuYj 
ip K (bu)e iuY * 



<Pe,m(u) <Pe(u) 
(x + q T )dx, 



f\u)/ J 



<Ar( u ) 



(x) dx. 



We will first study these three terms separately. Applying Planchcrcl's identity, the Cauchy- 
Schwarz inequality, the Neumann type bound (29) as well as \Ta s (u)\ ^ A s (l + |m|) _1 and the 
decay of tp s , we obtain 



[|0,i(6)| 2 l BeW ] ^ J 
E[I0,2(6)| 2 ]=E 



9 f l / b \Ta s {u)\ 2 f 1 /" \<p K {bu)\ 
du ' 



l/b \<Pe(u)\ 

1 

WKW^AUfrWooR 2 f 1/b 



du < 



1 



1/b m\(p e (u)\ 2 ~ (b 2 ?- 1 A l)m6 2 / 3 + 1 : 



jr as{u)e -iu qT mM e iuY jdu 

fe{U) 



4tt 2 



(l + \u\) 2 ^ 2 du=:S l 



l/b 



(52) 



(53) 



as well as the deterministic bound 



I0, 2 (6)| 2 = — / Ta a {u)e 



¥>i<(bu 
ip e (u) 



■e mY i du 



\\k\\IA r 1/b 



4tt 2 



(l + \u\) 2f} du=:d 2 . (54) 



l/b 



Hence, Var[^ 2 (6) 2 ] «S E[^. 2 (6) 4 ] < d 2 b Sl. and |£f )2 (£>) - E[£? 2 (6)]| «S 2d 2 , so that an application of 
Bernstein's inequality yields for any b > and z > 



2S 2 d 2 + \d\z 



3=1 

Setting z = Si and noting Sf < (6~ 2/3+1 V l),d 2 < 6~ 2/3 , we see that 

P (|^E(0-2W-E[0 2 , 2 (6)]) >S 2 ) ^2exp(-^J) S =:2exp(-Cn6 2 ^ 1 



3=1 



(55) 



for some constant C. The right-hand side of (55) tends to zero with polynomial rate since nb 2/3M > 
logn. 
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Finally, we use suppa s C [—1, 0] to write £^3 as 



£j',3(&) = / { a s(x - q T ,b) - a s (x - g T )) T 

'(5V,bAg T )-l 

The Cauchy-Schwarz inequality and Plancherel's identity yield 



ip K {bu)e luY i 



(x) dx 



< sup \a' s (t)\\q T . b - q T \ 
te(-l,o) 



ip K (bu)e iuY i 



(x) dx. 



\^ 3 (b)\ 2 <||oil ( _i,o)||2o|«r,6 - 9r| 2 (l + |5r,6 - 9r|) 

K%-i,o)|| a 



27T 



L kr,6 - g T | (1 + |g-r,6 - <?t|) 



(9T,bAg T )-l 
ip K {bu) 2 



du 



ip K (b 



u)e iuY i 



dx 



< 



\Qr,b - 9r| 2 (l + \q T ,b ~ qr\)b 



-2/3-1 



By Lemma 5.4 sup bgB \q T .b — <fr\ = °p(1)- Applying (49), we conclude for some constant C > 0, 
for o" 6 = (^+(1/2-/3)+ + n -i/2 6 -^-i/2^-i and for any v > q 

P(3be6:|^)|>#^ +1 / 2 V1)) 
^p(a6 G B : \q T , b - q T \ > V Cb U3M ^ +1 ^ + o(l) 
^p(a g 6 : \q T , b - q T \ > ijCSbib^ 1 + n'^-m/a v 1))) + o(l) 

<(Ewi+»(i) 



(56) 



b68 

<sup6 Q log n + sup }?f " + o(l) =o(l). 



Combining the variance bounds (52), (53) and (56), we apply Markov's inequality, the Cauchy- 
Schwarz inequality and the concentration result (55) on the decomposition (51) to obtain 



sup 

tee 



nib 2 ^ 1 Al)\a 2 b 



x - a b.x I J = SU P 



V n 



J2ttU b )-nt;Ub)]))+o P (i) = o P (i) 



3=1 



□ 



Lemma 5.8. Let B be a finite set satisfying \B\ < logn as well as sup bgg l/(nb 2 " +1 ) — > 0. Then 
we obtain uniformly over f G C a (R, r, £) and f s G D"{R, 7) /or a?/ ?/ > as n — > 00 

p(a6 G 6 : \a b , E - a b , £ \ > J7(Iogn)m-V a (6-/'+V2 v 1)) -> 0. 

Proof. We start by showing for 61 = min 6 that 

<Pe(u) 



sup 

|«|<l/6i 



fe,m(u) 



= 1 + P (1). 



(57) 



To this end, recall «;(«) = (log(e + \u\)) : / 2 for some 77 G (0,1/2). Markov's inequality, 
Lemma 5.1 and Theorem 4.1 by Neumann and Reifi (2009) yield for any 6 > 



P( sup 



P Eim («) 



> (5) <p( sup m 1 / 2 !^^) - (p e , m (M)\ > ^1 log foil") 



|u|^l/6i 



P inf I^HKm-^llog&rl 

V luj^l/b! 
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^log&iD^E sup m^lvJsCu) - <p e , m (u)\ +o(l) 

L |«|<1/&1 J 

1 



■(S|log6i|u;(l/6i) 
=o(l), 



2 



sup m 1/2 w(u)\ip e (u) - (p s ,m(u)\ + o(l) 



which is (57). Note that [ — l/&i, l/i>i] is the maximal interval for all b <G B and thus (57) holds 
uniformly in B. 

Now, we consider ot, )B i. The uniform consistency (57) implies 



5 b 2 £a = (l + 0p (l)) / 



<^n(u) 



<Pe(w) 



du. 



Chcbyshcv's inequality yields for all 77 > 



P(sup|( / l^(Hl|g||pd U 



bee 1 v Jr 



1/2 



E 



1/2 



> r) log 77 



^lognT^Ef f |^(^lH-ffid< 



<(?? 2 log 77.) 



6eB 

_ x r 1 ^ 1 E[|*> n («)l ! 



1/6! \<Pe(u)\< 



du < (77 2 log 77) 1 , 



where the last estimate follows from E[|<y9„(?i)| 2 ] < |< / 9y(M)| 2 +E[|(/3 rl (7i) — ipy (u)\ 2 ] < |<^y (it)| 2 + l/77, 
/ e G T>P(R, 7), ll/Hoo 1 an d 77& 2/3+1 — > 00. Hence, we obtain uniformly in £? 



<?6,e,l = (1 + Op(l))((7 b , £i i + Op(logTl)) = C7-6 )ei i + op (log n). 

Concerning &b,e,2i we write with use of (57) 



(58) 



^ 2 

a b,e,2 



1/6 



-1/6 



|J"a s (M)| : 

|ye,m(7i)| : 



1/6 

du = (1 + o p (l)) / |<pif(6it 
-1/6 



l^sHI 2 



du. 



Moreover, the triangle inequality for the L 2 -norm and Lemma 5.1, applied on B s [b\) which is 
defined in (18), yield 

1/6 . „ ,.| Fa s {u)\ 2 , \ V2 

\<p K {bu) y— d^j -o- M , 5 



1//, 



IVe(«)| 5 



<2 



E 



-1/6 



1/2 



^2E 



<- 



2P((B e (6 1 )) 
1/6 
-1/6 



l^s(u)| 2 

1/b . „ ..I Fa s {u)\ 2 , 

-1/6 



1/6 1 ^ h ) 1/2 )i 



l^(u)| 2 

7) - ^ e 



I ft MITT ( M2 l<^,™( u ) ~ VeHI 2 1 \-n 

|^jt(6«)|| -Fa s (u)| — ; — t-Vtt; du ) 1 



0(1) 



1/b |-Fa.(u)| s 



-1/6 



dw 



I log 61 1 3/2 

=o(i)(r wl vi), 



Vb i^-^m|^, TO (u) - (p e ( M )| 2 dul + o(l)(&- 2 ^ +1 V 1) 



-i/b 



where o(l) is a null sequence which does not depend on b. Consequently, 



sup 

bEB 



1/6 



1/6 



\<p K (bu)\ 



\fas(u)\ 2 y/i 

\MU)\ 2 



a b,s,2 



(i^ 1/2 Al) = o(l). 
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We get 

3^,2 = (1 + ° P (l))Ke,2 + o{b-f 3+l ' 2 V 1)) = a b , £ , 2 + opib-^ 1 ' 2 V 1), (59) 

where the last estimate follows from a 2 2 < 6~ 2,3+1 V 1 by the analysis of the convergence rates. 

bmcc a b e i < land cr Mj2 < 6^ +1 / 2 Vl,it remains to combine (58) and (59) to obtain uniformly 
in B 

1 -1/2— ~ 1-1 
= m ' — 



0V,i°V,2 = ^-™ _1/2 ((TM,i+op(logn))(a b , ei 2 + op(6- /3+1/2 Vl)) 



= <7fc, e + op ((log n)™- 1 / 2 ^-^ 1 / 2 VI)). □ 
5.2.2. Proof 0/ Theorem 3.2 

As seen in error decomposition (39), there are three stochastic errors Vb,x,V bte and Vb. c which 
were treated in Lemma 5.5. This motivates the following definition. For Si > let 



Sb,x ■= (1 + <Si)v / 21oglogn max ct m ,x, Sb,e ■= (Si log n) 3 max cr^. 

Note that on the assumption |< / 9 £ (u)| > (l + |u|) _/3 wc obtain for a b , e = ■^ r mr 1 / 2 ab,e,i< 7 b.£,2, defined 
in (45), that 



'L,2 > 



1/6 ,-1/b 

I J-a s (-u)| 2 (l + \u\fP du> (1 + M) 2/3 ~ 2 du ~ 6- 2/3+1 V 1 



-1/6 J-l/b 

and thus o-;, je > m~ 1 ^ 2 (b~ l3+1 ^ 2 V 1). Therefore, Lemma 5.5 yields 



P (3b G B : \V b ,x + Vb.e + Vb. c \ > S fc , x + Si 



^b,e\ , rl /'- 1L p I t r 1 ^ "Sb,; 



6,e 

^P(36 6 B : \V b .x\ > S b:X + ISb.e) + P(3beB: \V b , s \ > + P{3b G B : \V b , c \ > ^ 

=o(l). 

Hence, the probability of the event 

M := |\/6 G B : \V b>X + V b>e + V b .c\ ^ S b .x + S b , e } 
converges to one. The variances S b ,x and Sb. £ can be estimated by 



Sb,x ■= (1 + Si) \J 2 log log n max ct^x, S b . e ■= (Si log n) 3 max cr Mj£ . 

We apply Lemmas 5.7 and 5.8 and the triangle inequality of the £°°-norm to obtain uniformly in 
beB 

I max (7 x - max er„x | ^ max|oy x -<r„ x| = Qpf rFTFTir^m » ^ 
/^b p ' p^b p p ' p ' Vto 1 / 2 (6p^ 1 / 2 A 1) 

~ ii~ / logn \ 

maxo- M , e - maxtJue ^ max o\, e - a u ^\ = o P [ , TWT~r\ • 
^b ^ n^b n^b ^' ^ \m 1 /2( /3-i/2 A iy 

Using again a b , E ^ m _1 / 2 (o _/3+1 / 2 VI), we thus obtain for all 77 > that the event 

A 2 :={V6 G B : \(S b>x + S b , c ) - (S b , x + S b , e )\ v(S b .x + S b , e )} 

has asymptotically probability one. The same holds true for the events 

A 3 :=\vbeB: sup |/ 6 (g*) - f(q T )\^ t]f(q r )}, 

1 q*e[{q T f\q T , b )y(q T /\q T , b )] ' 
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A 4 :={V6 G B : sup \f b (q*) - J b {q T ,b)\ < r,\J b {q T , b )\) 

L q*£[(q r Aq T , b )V(q T A.q r , t )] J 

by (48). Therefore, it is sufficient to work in the following on the event 

A :=A 1 n A 2 n A 3 n A 4 . 

We now show that the adaptive estimator q T mimics the oracle estimator defined as follows. 
Recalling the estimate of the bias Bb = Db a+1 , let the oracle bandwidth be defined by 

K := max{6 eB:B b ^ S b , x + S b>e }. (60) 

Note that 6* is well-defined and unique since B b is monoton increasing in b while (S bf x + Sb,s) is 
monton decreasing. We get the oracle estimator q T , b , ■ 

Since on A4 for all b € B and q* € [{q T A <f T) &) V (g T A q T , b )] 

\fb(q*)\ > \fb(qr,b)\ - \fb(q*) - fb(qr,b)\ > (1 - V)\fb&r,b)\, 

we have for any b £ B on the event A\ H by (39) 

B 6 + \V b ,x + Vh. e + V b ,c\ / B b + S b .x + S b>e 



2|/ 6 (g*)| 2(1-^)1/6(^,6)1 
Furthermore, by the definition of &* we have on the event A for any b ^ 6* 

S b x + S be 



\q T ,b - qr 



(l-V)\Mqr,b)\ 



On A2 we can estimate S bl x + S b>e ^ (1 — r/)(S b , x + S blE ). In particular, we have on the event ^4 
for any ^ 6* 

|<2V,6 - <Zt| < 



(l-»7) 2 |/6(9r,6)| 



Since for any <5 > we find 61,77 > such that ((I - ?y)~ 2 (\/2 + 5t) - Vf) V ((1 - ?7)~ 2 <5i) < 6, 
we obtain 1^6 — q T \ ^ Ej, with Y. b as defined in (13). As a result one has q T £ U b and g r € W M 
for all 6^6, and /j ^ &*, implying U^r\U b ^ 0. By the definition of the procedure, 6* &* and 
n U br 7^ on the event A. This leads to 

\q r $, - q T \ <|<fr,6» -q r \ + \q T $. - <Zt,6,| < £&» + + %) 

On A 2 H A 3 we have < S btX + S bi£ since /((? T ) ^ r. Using additionally the monotonicity of 
(S b ,x + S b ,e) as well as b ^ &*, this implies 

\q Trb , - q T \ < (S K ,x + S b „ e ) < (0oglogn+ (logn 5 ) 3 ) (b7^ +1/2 V l)^ 1 ' 2 . 

It remains to note by the definition (60) of the oracle b t and by the assumption bj+i/bj < I that 

K ~ n -l/(2a+2(^Vl/2)+l) as n ^ □ 



References 

Carroll, R. J. and Hall, P. (1988). Optimal rates of convergence for deconvolving a density. 
Journal of the American Statistical Association 83 1184-1186. 

Carroll, R. J., Ruppert, D., Stefanski, L. A. and Crainiceanu, C. M. (2006). Measure- 
ment error in nonlinear models: a modern perspective 105. Chapman & Hall/CRC. 



/. Dattner, M. Reiji and M. Trabs/Quantile estimation in deconvolution 



3.3 



Comte, F. and LACOUR, C. (2011). Data-driven density estimation in the presence of additive 
noise with unknown distribution. Journal of the Royal Statistical Society: Series B ( Statistical 
Methodology) 73 601-627. 

Dattner, I., Goldenshluger, A. and Juditsky, A. (2011). On deconvolution of distribution 
functions. Ann. Statist. 39 2477-2501. 

Dattner, I. and Reiser, B. (2013). Estimation of distribution functions in measurement error 
models. J. Statist. Plann. Inference 143 479-493. 

Delaigle, A., Hall, P. and Meister, A. (2008). On deconvolution with repeated measure- 
ments. Ann. Statist. 36 665-685. 

Dudley, R. M. (1992). Frechet Differentiability, p- Variation and Uniform Donsker Classes. The 
Annals of Probability 20 1968-1982. 

Fan, J. (1991). On the Optimal Rates of Convergence for Nonparametric Deconvolution Problems. 
Ann. Statist. 19 1257-1272. 

Frese, E. M., Fick, A. and Sadowsky, H. S. (2011). Blood pressure measurement guidelines 
for physical therapists. Cardiopulmonary physical therapy journal 22 5. 

Girardi, M. and Weis, L. (2003). Operator-valued Fourier multiplier theorems on Bcsov spaces. 
Mathematische Nachrichten 251 34-51. 

Goldenshluger, A. and Nemirovski, A. (1997). On spatially adaptive estimation of nonpara- 
metric regression. Mathematical Methods of Statistics 6 135-170. 

Hall, P. and Lahiri, S. N. (2008). Estimation of distributions, moments and quantiles in de- 
convolution problems. Ann. Statist. 36 2110-2134. 

Johannes, J. (2009). Deconvolution with unknown error distribution. Ann. Statist. 37 2301- 
2323. 

Johannes, J. and Schwarz, M. (2013). Adaptive circular deconvolution by model selection 

under unknown error distribution. Bernoulli. To appear. 
Kannel, W. et al. (1995). Framingham study insights into hypertensive risk of cardiovascular 

disease. Hypertension research: official journal of the Japanese Society of Hypertension 18 181— 

196. 

Kappus, J. (2012). Nonparametric adaptive estimation for discretely observed Levy 
processes. Available at http://edoc.hu-berlin.de/disscrtationcn/kappus-julia-johanna-2012-09- 
28/PDF/kappus.pdf. 

Lepskii, O. V. (1990). A problem of adaptive estimation in Gaussian white noise. Teor. Veroy- 
atnost. i Primenen. 35 459-470. 

Lounici, K. and Nickl, R. (2011). Global uniform risk bounds for wavelet deconvolution esti- 
mators. Ann. Statist. 39 201-231. 

Massart, P. (2007). Concentration inequalities and model selection. Lecture Notes in Mathemat- 
ics 1896. Springer, Berlin. 

McMurry, T. L. and Politis, D. N. (2004). Nonparametric regression with infinite order flat- 
top kernels. Journal of Nonparametric Statistics 16 549-562. 

Meister, A. (2004). On the effect of misspecifying the error density in a deconvolution problem. 
Canadian Journal of Statistics 32 439-449. 

Neumann, M. H. (1997). On the effect of estimating the error density in nonparametric decon- 
volution. Journal of Nonparametric Statistics 7 307-330. 

Neumann, M. H. (2007) . Deconvolution from panel data with unknown error distribution. Journal 
of Multivariate Analysis 98 1955-1968. 

Neumann, M. H. and Reiss, M. (2009). Nonparametric estimation for Levy processes from 
low-frequency observations. Bernoulli 15 223-248. 

Nickl, R. and Reiss, M. (2012). A Donsker theorem for Levy measures. J. Fund. Anal. 263 
3306-3332. 

Rosenthal, H. P. (1970). On the subspaces of LP (p > 2) spanned by sequences of independent 

random variables. Israel J. Math. 8 273-303. 
Sohl, J. and Trabs, M. (2012). A uniform central limit theorem and efficiency for deconvolution 

estimators. Electron. J. Statist. 6 2486-2518. 
Spokoiny, V. and Vial, C. (2009). Parameter tuning in pointwise adaptation using a propagation 



/. Dattner, M. Reifi and M. Trabs/Quantile estimation in deconvolution 



31 



approach. Ann. Statist. 37 2783-2807. 
Stirnemann, J. J., Comte, F. and SAMSON, A. (2012). Density estimation of a biomedical 
variable subject to measurement error using an auxiliary set of replicate observations. Statistics 
in Medicine. 

Triebel, H. (2010). Theory of Function Spaces. Birkhauser Verlag, Basel Reprint of the 1983 
Edition. 

van der Vaart, A. W. (1998). Asymptotic Statistics. Cambridge University Press, Cambridge. 



